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An ad hoc map evaluation procedure is proposed which is most suitable for 
evaluating low-resolution classification maps against high-resolution 
ground- truth maps, such as Land Satellite maps against interpreted air- 
craft photographs. Commonly practiced sampling and evaluation procedures 
are impracticable in this context because of difficulties in registration 
and in comparing the samples, as experienced in the recent Tri-County 
Pilot Study performed in the Forestry Applications Project. This ad hoc 
procedure is designed to overcome these two major problems, and its 
practicability is discussed. 

Two widely accepted parameters are estimated by the new procedure; 
namely, the probability of correct classification and the proportion 
biases. Statistical qualifications are also provided. 
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PREFACE 


An ad hoc procedure was proposed, discussed, and agreed upon in 
a series of group meetings attended by the followihg personnel 
in the Forestry Applications Project: T. Austin, R. Dillman, 

E. Downes, E. Kan, A. Kerber, C. Reeves, and J. Ward. The suc- 
cesses and failures experienced in the evaluation process per- 
formed in the Tri-County Pilot Study provided useful insight into 
this ad hoc design. The invaluable reviews and comments on this 
paper, contributed by T. Austin and by the Forestry Applications 
Project Scientist, R. W. Douglass, are hereby acknowledged. 
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1 . INTRODUCTION 


Evaluation of map accuracies by sampling and estimating the over- 
all probability of correct classification (PCC) has been discussed 
in theory and practice {ref. 1). This procedure has been demon- 
strated {ref. 2) to be more appropriate for overall map evaluation 
than procedures using training class 'accuracy, average accuracy by 
class, and others which have been commonly practiced in remote 
sensing applications (ref. 3). 

The procedure of sampling and estimating the PCC to evaluate the 
soils resource inventory maps prepared by the Forestry Applications 
Project is documented in reference 4. When interpreted soil maps 
at scales near 1:60 000 were checked against ground samples and 
against U.S. Forest Service base maps at similar scales, the pro- 
cedure proved practicable and practical. (Soils resource maps 
were prepared by interpreting aircraft photography.) In those 
resource maps , landform features were normally large enough or 
wide enough to permit grid cell systems {refs. 1 and 5) with 
sizable grid cell samples [e.g., cell samples of 5 millimeters 
square (0.19 inch square)]. "Sizable" is a relative term, but 
here it is a very important concept because it implies workable, 
practicable, and perhaps practical. 

The same basic procedure was used in the Tri-County Pilot Study 
(TRICPS, ref. 6) but the use of the procedure was frustrating 
and gave PCC estimates much lower than expected. These conclu- 
sions were based on (1) registration inaccuracy (It was almost 
impossible to reliably locate selected samples on the classifi- 
cation map and on the ground-truth map.) and (2) comparison 
inaccuracy [caused by the size of the sample and the majority rule 
in deciding the type ( i . e . , class or feature) to be associated 
with the sample] . 
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Ar> analysis of the experimental design of the evaluation procedure 
used in TRICPS revealed facts that explained the two kinds of 
inaccuracy. The TRICPS developed classif icatidn maps of the first 
Land Satellite (Landsat-1) multispectral scanner data at approxi- 
mately 60 meters (197 feet) resolution (i.e., each picture ele- 
ment (pixel) is 60 meters square (197 feet square) on the 
ground] . These low resolution maps were compared with aircraft 
photography which was interpreted at sample locations and used 
as ground truth. At the 1:120 000 scales, the aircraft photography 
had much higher resolution than the Landsat maps. Knowing that 
it was impossible to use the generic Landsat pixels as sample 
units, a 3- by 3-pixel sample unit was used. The majority rule 
was also employed for assigning a class type on Landsat and 
photography data samples. (Details of the TRICPS evaluation pro- 
cedure can be found in appendix A. ) The evaluation process was 
further complicated by the fact that Landsat classification maps 
are often spotty, and that narrow features such as hardwood stringers 
often gave rise to multiclass samples such that the majority-rule 
decision appeared shaky, although the majority rule is adequate 
for competition between only two classes. 

As a result of the inaccuracies found in the evaluation procedure 
used in TRICPS, an ad hoc map evaluation procedure is proposed for 
cases when the low resolution classification maps are evaluated 
against the high resolution ground- truth maps. This ad hoc pro- 
cedure minimizes registration inaccuracies and comparison inaccu- 
racies. The procedure attempts to evaluate the per-pixel classi- 
fication accuracy of the map by using a sample size which is small 
enough (2 by 2 pixels) to reflect pixel classifications and which 
is large enough to absorb some possible errors caused by misregis- 
tration and mixture pixels. As a result, PCC and proportion 
biases B are estimated. The latter measure of proportion biases 
is actually a byproduct of PCC calculations and is a secondary, 
subopt imally designed measure for the present evaluation design. 
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The biases provide accuracy measures for areal measurements which 
are prime objectives of some investigations. 

Section 2 describes the procedure by using an example, thereby 
providing better insight into the rw relty and motivation of the 
ad hoc design as discussed in section 3. Section 4 gives statis- 
tical qualifications of evaluation parameters and discusses the 
implications of the proportion bias measure and its relation to 
the PCC. Practical considerations in using the ad hoc procedure 
are discussed in section 5. Appendix B also explains the choice 
of the decision rule and its accompanying threshold to determine 
sample correct classification; the decision rule is shown to be 
more discriminatory than the widely used majority rule. 
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2. AD HOC PROCEDURE 

The design of this procedure is based on three assumptions: 

(1) a large Landsat classification map (i.e., over 500 000 pixels) 
is evaluated against interpreted aircraft photography at scales of 
1:60 000 or 1:120 000, (2) sample locations can be located to 
within one landsat pixel (step 2 of the procedure) , and (3) no 
dramatic classification error exists (e.g. , errors as a result 
of consistently classifying forest as nonforest and vice versa) . 


2 .1 PROCEDURE DESCRIPTION 

The natural grid system on the classification map is the pixel 
grid. Assuming a total of K classes (types or feature: 5 is 
represented in the map, including a class of "others," these 
steps should be followed. 

Step 1 — Randomly select M (see section 4.1) primary sampling 

unit (PSU) on the classification map. Each PSU is the 
size of 50 by 50 Landsat pixels. 


Step 2 — Locate the same M PSU’s on the photograph, using dis- 
tinguishing features such as roads, intersections, and 
landmarks. Local registration error should be minimized 
to within 1 Landsat pixel. 


Step 3 — Randomly select 10 secondary sampling units (SSU) within 
each PSU on the classification map; each SSU is the size 
of 2 by 2 Landsat pixels. 

Step 4 — Using the PSU framework on the photograph as determined 
in step 2, locate the same SSU's on the photograph. 


Step 5 — Determine the proportion of each class in each SSU on 
the photograph. Denote by p ^ (where m = 1 , • • • ,M; 
n = 1, • • • ,10 ; and k = 1, • • • ,K) the proportion of the kth 
class interpreted in the nth SSU of the mth PSU. 
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Step 6 - For each of the nine possible locations on or about the 

selected m nth SSU on the Landsat map, determine the pro- 
portions p^ n £, i 88 9, by counting pixels. (See 

section 2. 2 for clarification of this step.) 

Step 7 — For each of the nine locations in each mn th SSU, deter- 

mine the error defined as: 

* £ 

k=l 

Denote the smallest of the nine errors as E_ with the 

mn 

corresponding proportions P mnk f K - 1,* #, /K. 



Step 8 — If E mn $ 0.15 (see appendix B for the determination of 

this threshold) , call this m£h SSU correctly classi- 
fied; otherwise, this math SSU is incorrectly classified. 


Step 9 — Calculate PCC as 


PCC 


Number of correctly classified SSU*s 
Total number of SSU* s 


( 2 ) 


In the present case, the denominator of equation (2) 
equals 1QM. 

Step 10 — Calculate the kth class proportion bias as 

1 M 10 a .. 11/2 

B k “ TOM X) X) ( P mnk "* p mnk) 
m=l n=l 

and the root-mean-square (rms) overall bias as 

1/2 


B 


1 K 9 
X T— \ ^ 2 . 


rms \ K X B k 


(3) 


(4) 


k=l 


Step 11 — Using the PCC as calculated in step 9, calculate the 
confidence interval at specified confidence level 
(section 4.2). If the interval is satisfactory, stop; 
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otherwise, incre$3e M and repeat steps 1 through 11 
(refs. 1 and 7). 

2.2 AN EXAMPLE OF THE AD HOC PROCEDURE 

This subsection gives an example of the execution of steps 5 
through 8 of the procedure. The other steps are self-explanatory. 

Consider the generic nth SSU in the mth PSU, hereafter called 
the SSU, with the m and n indexes dropped, in figure 1, Part a 
of figure 1 shows the location of the SSU on the classification 
map, with an enlargement of the pixel assignment to class 1 or 2 
in the 16 pixels containing the SSU. Part b of figure 1 shows 
the corresponding SSU on ground- truth photographs; the demarcation 
is interpreted to separate feature class 1 on the left from fea- 
ture class 2 on the right. 

Execution of step 5 of the procedure results in the estimates: 

p^ = 0. 7 ? P 2 s 0.3 (5) 

Notice that the indexes m and n are dropped; indexes 1 and 2 
denote the only two classes of interest. 

Step 6 of the procedure requires the examination of each of the 
nine possible locations on or about the SSU (fig. 2). Retaining 
only the indexes for class 1 or 2 and the superscript for the i th 
location, the proportions are estimated and tabulated in table I. 

According to step 7 of the procedure and using the definition of 

(i ) 

E mn ec J uat ^ oh (!) t the errors for the nine possible locations 
are tabulated in the rightmost column of table I. The smallest 
of these errors is 0, occurring at location (b) or (d) with 
p^ - 0.75 and p 2 = 0.25. 
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TABLE I.— TABULATION OF THE DIFFERENCES, E, BETWEEN THE NINE POSSIBLE SSU'S IN 
FIGURE 2 AND THE INTERPRETED SSU ON THE PHOTOGRAPH IN PART A OF FIGURE l a 
















P. - PROPORTION Of CLASS 1*07 
Pj - PROPOR TION OF CLASS 2-03 


Is) 


!b) 


rn ,h PSU ON ClASSIFICA TION MAP 


m ,h PSU ON PHOTOGRAPH 


(a) The ntn SSU in mth PSU (b) The same SSU in the same 

on classification map; PSU on photograph; 

enlarged view of pixel interpretation of SSU 

assignment to class 1 and 2 with assignment to 

in SSU and surrounding class 1 or 2. 

pixels 


Figure 1.— An example of the procedure. 
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Figure 2.— The nine possible locations on or about the SSU of 
figure 1, with pixel assignment to class 1 or 2 . 
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In step 8 , E (E of the nth SSU in the mth PSU) is checked 
e mn 

against the threshold of 0.15. Since E = 0.01 < 0.15, this mn th 
SSU is considered to be correctly classified. 


The proportion biases in this mnth SSU are 

r 2i 1/2 

B mnl 51 L(0.75 - 0.7rJ = 0.05 

and 

1/2 

B mn2 = [(0.25 - 0 . 3 ) 2 ] = 0.05 


(6) 


(7) 



3. NOVELTY OF THE AD HOC PROCEDURE 

As previously stated, the two main objectives of the proposed 
design are: 

a. To minimize registration inaccuracies between a low resolution 
classification map and a high resolution ground- truth map; for 
example, a Landsat classification map versus interpreted air- 
craft photographs at scales of 1:60 000 and 1:120 000. 

b. To minimize comparison inaccuracies, knowing that every 
sampling unit will likely contain more than one class of 
interest; in the present situation, a sample unit is a con- 
glomerate of pixels. 

The first objective is achieved by (1) having large (50 by 50 
pixels) PSU's so that by identifying dominant landmarks within the 
PSU local registration error can be minimized to no more than 1 
pixel within the PSU and (2) considering the nine SSU locations 
(SSU size, is 2 by 2 pixels), on or about the designated location 
as candidates in the classification map, one of which is closest 
to perfect registration with the designated SSU on the ground- 
truth map. 

The second objective is achieved by using error measures consist- 
ing of the difference in proportions of classes defined in equa- 
tion (1) to determine the identity of classified SSU to ground-truth 
designated SSU. This comparison method is more discriminatory than 
using a majority rule to determine the unique SSU classification 
which in turn is used for comparison, as discussed in appendix B. 

The present method uses the PCC concept to attempt to evaluate 
the per-pixel classification. This is possible because SSU's of 
sizes 2 by 2 pixels are small enough to reflect pixel classifica- 
tions and yet large enough to absorb some possible errors caused by 

REPRODUCIBFJTY OF THF 
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misregistration and mixture pixels. It is assumed that no extreme 
classification errors exist? for example, errors caused by consist- 
ently classifying forest as nonforest and vice versa, in which case 
the SSU comparison method by proportion differences will result in 
a totally erroneous PCC. 
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4 . STATISTICAL QUALIFICATIONS 

Three parameters are further discussed, the sample size M, the 
PCC and the corresponding confidence interval, and the proportion 
biases B. 

4.1 SAMPLE SIZE M 

Standard procedures (refs. 1, 2, and 7) provide suitable sample 
sizes M. ^ The basic assumption is that the PCC has a binomial 
distribution. The formulas are 

M = 0.25 (t/AE) 2 (8) 

or M - PCC(1 - PCC) (t/AE) 2 (9) 

according to the availability of an estimated PCC at the begin- 
ning of the evaluation; t is the value in the table of 
t“ distribution corresponding to a prespecified confidence level; 
AE is the allowable error, that is, permitted confidence interval 
half range. 

4.2 CONFIDENCE INTERVALS OF PCC 

Standard procedures (refs. 1, 2, and 7) also provide confidence 
intervals of PCC. The confidence interval half range is 

t/PCC(l - PCC)/M (10) 

where again t is the given value in a table of t-distribution 
which has been assigned a prespecified confidence level. 


^In the ad hoc procedure, M PSU’s with 10 SSU's in each PSU are 
proposed. Effectively, 10 M samples are used for evaluation. 
Thus, 10M should replace M in equations (8), (9), and (10). 
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4.3 PROPORTION BIASES AND IMPLICATIONS 


The proportion biases and the rms value B rmg in equations (3) 
and (4) are widely accepted evaluation parameters; however, they 
are not as easily understood as PCC. Whereas and B rms are 
appropriate measures for proportion accuracies, they are inade- 
quate for measuring map accuracies although a high map accuracy 
{high PGC) is generally accompanied by small biases. Propor- 
tion biases for the entire map or for samples selected from the 
map are used frequently thereby avoiding the problem of registra- 
tion inaccuracies. It is usually assumed that registration errors 
tend to cancel out with large samples. 


To further understand the relationship between proportion biases 
and the PCC, figures 3 and 4 depict how biases and B^ g vary 


with the PCC. 
described, 


[It can be shown that for a two-class case, as 


(1 - 2 q) 


(1 - PCC) ; 


B 2 - 


(*3 - U 


1 - g 


a - 


=vK3). 


In these figures, a two-class map is 


and B - — «- 

rms ? \ 2 

analyzed wherein class 1 h^is a priori probability (i. e. , propor- 
tion in map) q and class 2 has a priori probability 1 - q. Four 
curves are shown in each figure and labeled with the- PCC value. 
For example, the case PCC = 0.9 assumes that the classification 
accuracies of classes 1 and 2 are both 0.9. (Thus, overall 
PCC 5 qp (1/1) + (1 - q) p (2/2 ) = p(l/l) = p(2/2) ? p(i/i) denotes 
class i accuracy.) 


As an example, when class 1 has a priori probability of 0.8 and 
PCC = 0.9, the theoretical biases can be derived from figures 3 
and 4 . 

B 1 = 0.08 


B 2 — 0.3 


( 11 ) 


B =0.22 
rms 
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When q « 0 . 8 and PCC = 0.7, 



B 

rms 


0.23 
0.90 
= 0.66 


( 12 ) 


It can be seen that the relationship between the biases and the 
PCC is very complicated even for the two- class map. No attempt 
has been made here to illustrate the more complicated multiple- 
class cases. The complex relationship between the biases and the 
PCC can be used to conclude that biases B, and B„__ are inadequate 

JC XIUS 

measures for evaluating map accuracies, even though they are 
appropriate for measuring proportion accuracies. 

Note: In the present ad hoc procedure, the proportion bias param- 

eters are only secondary to and a byproduct of the calculation 
of the PCC measure; hence, their estimation is suboptimally designed 
in the present evaluation. The PCC is felt to be a more appro- 
priate measure for map accuracy assessment, as discussed in the 
present context. However, proportion biases can be the prime 
evaluation parameter in investigations where the objective is 
areal measurement. 
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5. PRACTICAL CONSIDERATIONS 


The main practicality factor and practicability consideration 
are the manipulation and registration of classification maps 
(Landsat maps) and ground- truth maps (interpreted aircraft 
photographs) . 

The procedure requires photographic enlargement of Landsat classi 
fication maps on positive prints or transparencies; composite 
classification maps are desired rather than single-class, theme 
prints as produced on the Gould printer of the General Electric 
Interactive Multispectral Image Analysis System (IMAGE 100) . 
Aircraft photography on positive prints or transparencies should 
be enlarged on optical instruments such as the Zoom Transfer 
Scope or the Kargl reflecting projector/rectifier on which 
registration is performed. For easy data handling and for lesser 
geometric distortion, the 1:120 000 or 1:60 000 scale photographs 
are desired. Using such small scales, the amount of stretching 
on the optical instruments during registrations will be minimised 
or even eliminated. Table II summarizes these practical 
considerations. 


TABLE II.- PRACTICAL CONSIDERATIONS WHEN USING THE NEW AD HOC PROCEDURE 
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6 . CONCLUSION 


As a result of previous experience (refs. 1/ 2, 4, and 6), this 
ad hoc procedure was developed to evaluate low resolution classi- 
fication maps (Landsat) against high resolution ground-truth maps 
(interpreted aircraft photographs) . The data offered in this 
document support the value of this new design in (1) minimizing 
registration inaccuracies and (2) minimizing comparison inaccu- 
racies. The ad hoc procedure provides fc the evaluation param- 
eter of the PCC and the byproduct of proportion biases (B) . The 
statistical implications of PCC and B are also discussed. It 
can be concluded that this ad hoc procedure is practical and 
practicable. 

The proposed, novel design attempts to evaluate the per-pixel 
classification by estimating the PCC and by using a sufficiently 
small SSU size (2 by 2 pixels) to reflect pixel classifications 
and yet a sufficiently large SSU size to absorb some possible 
errors due to misregistration and mixture pixels. This ad hoc 
procedure is recommended for reevaluating the classification 
results from the TRICPS. 
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APPENDIX A 


EVALUATION PROCEDURES USED IN TRICPS 

The following is excerpted from the final report of the TRICPS. 
From the entire tricounty site, 100 plots were randomly selected, 
each plot being 10 by 10 pixels. These plots were located on the 
output computer classification maps which were printed by the 
Gould printer, reproducing one feature (versus the remaining) in 
any one print. 

Each of the 10- by 10-pixel plots was subdivided into nine equal 
samples. Thus, each sample is roughly 3 by 3 pixels. The pro- 
portions of each class (softwood, hardwood, mixed, .range, and 
'‘others'*) were counted in each of the 900 (100 x 9) samples. 

Each sample was then classified into its major class. 

Using the Kargl reflecting projector/rectifier, the 100 randomly 
selected plots (900 samples) were located on the available photog 
raphy at a scale of 1:120 000. The photography was interpreted 
and used as ground truth. Each sample was interpreted and classi 
fied by its major type. 

The majority-rule classification of the computer-mapped sample 
was then compared to the majority- rule classification of the 
ground-truth sample. A calculation of accuracy followed, similar 
to the method stated in the text. 

Notes 

1. Local registration error of plots was high. Reliable 

registration of a 10- by 10-pixel plot on the photograph 
was extremely difficult because 

reproducibility of rm 
ORIGINAL FxiGai IS POOR 
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(a) The geographic area covered by a plot was too small. 

Major landmarks in or surrounding the plot could not be 
used effectively to produce good registration. 

(b) Gould printer output maps contained one class versus the 
remainder; using them to register to photographs was very 
difficult because the one-class maps (actually two 
classes, the one being considered and the remainder) 
contained insufficient details to be used for effec- 
tive registration. 

2, Sample classification by majority rule was shaky. The classi- 
fication of an approximate 3- by 3-pixel sample by its 
majority type was too gross because the area covered by a 
sample [approximately 4.45 hectares (11 acres)] was too large 
and normally contained two or more types . Features like 
hardwood were normally narrower than the 3- by 3-pixel sample 
width and, therefore, tended to become misclassif ied when a 
small error existed in the registration. Furthermore, the 
sample comparison did not take advantage of the per-pixel 
classification (see section 3) . 
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APPENDIX B 


ERROR THRESHOLD FOR DECIDING CORRECT SSU CLASSIFICATION 
B.l RECAPITULATION OF THRESHOLDING PROCEDURE 

For each of the nine possible locations on or about the designated 
SSU on classification map (see fig. 2 in text) , the error meas- 
ure E^, i = 1, • • • ,9, is calculated as defined by 



(Bl) 

t 


where p^ and Pj^ denote the proportions of the k th class among 
K classes in the ground-truth SSU and the i th location of the 
classification map SSU, respectively. (The indexes m and n 
corresponding to the PSU and SSU locations are dropped here.) 

The smallest of these E^- , i = !,*••, 9, is designated E; E is 
checked against the recommended threshold 0.15; E > 0.15 means 
that the mapped SSU is incorrectly classified and E < 0.15 means 
that the SSU is correctly classified. 

B. 2 ESTABLISHMENT OF THRESHOLD VALUE 0.15 

Before determining the threshold, it is recognized that even if 
there were no classification error, the residual local registra- 
tion error and the quantization of p^ (in the 25 percentile) will 
still give rise to nonzero E. Thus, a minimal amount of E must 
be allowed, beyond which the SSU is considered to be incorrectly 
classified. This residual error is illustrated by figures Bl and 
B2. Figure Bl is a ground-truth SSU containing classes 1 and 2 
with p^ = 0.7 and ~ 0*3. Assuming that a 1-pixel registration 
error exists between the classification-map SSU and the ground- 
truth SSU, the nine cases in figure B2 are possible configurations 
for the better SSU locations on the classification map; "better" 
means displaced by no more than half a pixel. The minimum E ^ 
will be equal to or less than the closest configuration from 
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CLASS 1 


CLASS 2 



P, ■ PROPORTION OF CLASS I 
IN SSU * 0 7 

P p • PROPORTION OF CLASS 2 
IN SSU * 0 3 


Figure Bl.— Portion of ground-truth map showing location of SSU 
and its interpreted partition into classes 1 and 2. 
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among the nine cases in figure B2. It is easily calculated that 
none of the configurations in figure B2 has exactly zero E. 


Based on the above deduction, a proper threshold can be calculated 
by examining the minimum errors associated with all possible pixel 
assignments to all possible SSU compositions. To do this, two 
assumptions need to be made: 


b. 


The 2- by 2-pixel size of a SSU is small enough that, at 
most, two classes fall within an SSU. 

The proportions p k in a ground-truth SSU are estimated in 
increments of 10 percent. 


Using assumption a , there are only five possible pixel assign- 


ment configurations; i.e., 


0.25, and p^ = 0.0 


(P. 


P-L = 1.0, 

, = 1 ~ P-, 


P 1 = 0.75, p x = 0.50, 
and needs not be further 


stated) . For any assignment with p 1 , the error committed when 
the ground truth has p 1 will be 


E = (Pl - Pi ) 2 + 
° ( P 1 ‘ P l ) 2 + 

= 2 ( P 1 ' Pl) 2 





Take the example of p^ = 0.75, E will be a function of p^ 


(B2) 


E = 2(p x - 0. 75) 2 (B3 ) 

Figure B3 plots all the five cases? i.e., p 1 =0.0, 0.25, 0.50, 
0.75, and 1.0. 


Using figure B3 and assumption b , the following table can be 
prepared (the symmetry of the table for p.^ and below 0.5 makes it 
necessary to analyze only half of the full range of p^) . 
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Figure B3.— Error E between ground-truth SSU configuration 
having p^ and SSU pixel assignment having p^. 
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I 


i 


Pi 

Those p^ such 
that E < 0.15 

Maximum proportion 
error, max \p ± - pj 

1.0 

0.75, 1.0 

0.25 

.9 

. 75 , 1.0 

.15 

.8 

.75, 1.0 

.2 

.7 

.5, .75 

.2 

.6 

.5, .75 

.15 

.5 

.25, .50, .75 

.25 


When a threshold of 0.15 is used, a maximum deviation of 0.15 to 



tolerated, beyond which the SSU is considered incorrectly 
classified. 


Notice that this rule is more discriminatory than the majority 
rule. When is 1.0, 0.9, or 0.8, a pixel configuration having 
P 1 « 0.5 is considered correct classification by the majority 
rule but not by the present error threshold using 0.15. Similarly, 
for p^ « 0.7, 0.6, or 0.5, the pixel configuration with = 1.0 
is considered correct classification by the majority rule but not 
by the threshold using 0.15. Finally, for p^ - 0.5, the pixel 
configuration with p^ = 0. 25 is considered incorrect classifica- 
tion by the majority rule but correct classification using the 
0.15 threshold. A correct classification is more plausible than 
incorrect classification in this case. 

By experimenting with threshold values other than 0.15, similar 
conclusions can be drawn on the utility of the error- thresholding 
method. The value of 0.15 for the threshold, however, seems to 
allow sufficient tolerance in the proportion error between 0.15 
to 0.25 without being overly lenient. An empirical study of a 
few cases of a three-class SSU classification produced the same 
conclusion. 
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